image view
Opening the Black Box of 3D Reconstruction Error Analysis with VECTOR
Fygenson, Racquel, Jawad, Kazi, Li, Isabel, Ayoub, Francois, Deen, Robert G., Davidoff, Scott, Moritz, Dominik, Hess-Flores, Mauricio
This is the author's version of the article that has been published in the proceedings of IEEE Visualization conference. The final version of this record is available at: xx.xxxx/TVCG.201x.xxxxxxx/ This metric also provides no visibility into how particular Reconstruction of 3D scenes from 2D images is a technical challenge images, lighting conditions, camera positions, or details of the that impacts domains from Earth and planetary sciences and morphology of the remote environment might interact to create inaccuracies space exploration to augmented and virtual reality. The impact of these unknowns algorithms first identify common features across images compounds in domains where high accuracy terrain reconstruction and then minimize reconstruction errors after estimating the is critical to outcomes, like science or space exploration where there shape of the terrain. This bundle adjustment (BA) step optimizes is no ground truth and inaccurate reconstruction can lead to false around a single, simplifying scalar value that obfuscates many possible results or risking billion-dollar spacecraft.
Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples
Assran, Mahmoud, Caron, Mathilde, Misra, Ishan, Bojanowski, Piotr, Joulin, Armand, Ballas, Nicolas, Rabbat, Michael
This paper proposes a novel method of learning by predicting view assignments with support samples (PAWS). The method trains a model to minimize a consistency loss, which ensures that different views of the same unlabeled instance are assigned similar pseudo-labels. The pseudo-labels are generated non-parametrically, by comparing the representations of the image views to those of a set of randomly sampled labeled images. The distance between the view representations and labeled representations is used to provide a weighting over class labels, which we interpret as a soft pseudo-label. By non-parametrically incorporating labeled samples in this way, PAWS extends the distance-metric loss used in self-supervised methods such as BYOL and SwAV to the semi-supervised setting. Despite the simplicity of the approach, PAWS outperforms other semi-supervised methods across architectures, setting a new state-of-the-art for a ResNet-50 on ImageNet trained with either 10% or 1% of the labels, reaching 75.5% and 66.5% top-1 respectively. PAWS requires 4x to 12x less training than the previous best methods.
A Convolutional Architecture for 3D Model Embedding
Labrada, Arniel, Bustos, Benjamin, Sipiran, Ivan
During the last years, many advances have been made in tasks like3D model retrieval, 3D model classification, and 3D model segmentation.The typical 3D representations such as point clouds, voxels, and poly-gon meshes are mostly suitable for rendering purposes, while their use forcognitive processes (retrieval, classification, segmentation) is limited dueto their high redundancy and complexity. We propose a deep learningarchitecture to handle 3D models as an input. We combine this architec-ture with other standard architectures like Convolutional Neural Networksand autoencoders for computing 3D model embeddings. Our goal is torepresent a 3D model as a vector with enough information to substitutethe 3D model for high-level tasks. Since this vector is a learned repre-sentation which tries to capture the relevant information of a 3D model,we show that the embedding representation conveys semantic informationthat helps to deal with the similarity assessment of 3D objects. Our ex-periments show the benefit of computing the embeddings of a 3D modeldata set and use them for effective 3D Model Retrieval.